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4 


<110> 


APPLICANT : 


Matsui, 


Ikuo 
















IE 


5 






Ishikawa, 


Kazuhiko 




















6 






Ishida 


, Hiroyasu 






















7 






Kosugi 


, Yoshitsugu 




















9 


<120> 


TITLE 


OF INVENTION 


: THERMOPHILIC ENZYMES HAVING 






1 0 






BETA-GLYCOSIDASE ACTIVITY 


















1 9 


<130> 


FILE REFERENCE 


: 11059/002001 
















1 4 


<140> 


CURRENT APPLICATION NUMBER: 09/369^356 










1 

J. *j 


<141> 


CURRENT FILING 


DATE: 1999- 


08-06 














1 7 

JL I 


<160> 


NUMBER 


OF 


SEQ 


ID NOS: 


10 


















1 Q 


<170> 


SOFTWARE : 


FastSEQ 


for 


Windows 


Version 


4.0 










21 


<210> 


SEQ ID 


NO: 


1 
























99 


<211> 


LENGTH 


: 1269 
























9 1 


<212> 


TYPE: 


DNA 


























94 


<213> 


ORGANISM: 


Pyrococcus horikoshii 














u 


<220> 


FEATURE : 


























97 


<221> 


NAME/KEY : 


CDS 
























9ft 


<222> 


LOCATION: 


(1).. 


. . (1269) 




















•J VJ 


<400> 


SEQUENCE : 


1 
























J -L 


atg 


ccg 


ctg 


aaa 


ttc 


ccg 


gaa 


atg 


ttt 


etc 


ttt 


ggt 


acc 


gca 


aca 


tea 


19 


Met 


Pro 


Leu 


Lys 


Phe 


Pro 


Glu 


Met 


Phe 


Leu 


Phe 


Gly 


Thr 


Ala 


Thr 


Ser 


1 1 


1 










5 










10 










15 




IS 


tec 


cat 


cag 


ata 


gag 


gga 


aat 


aat 


aga 


tgg 


aat 


gat 


tgg 


tgg 


tac 


tat 


16 


Ser 


His 


Gin 


He 


Glu 


Gly 


Asn 


Asn 


Arg 


Trp 


Asn 


Asp 


Trp 


Trp 


Tyr 


Tyr 


17 










20 










25 










30 






19 


gag 


cag 


att 


gga 


aag 


etc 


ccc 


tac 


aga 


tct 


ggt 


aag 


get 


tgc 


aat 


cac 




Glu 


Gin 


He 


Gly 


Lys 


Leu 


Pro 


Tyr 


Arg 


Ser 


Gly 


Lys 


Ala 


Cys 


Asn 


His 


41 








35 










40 










45 








4 1 


tgg 


gaa 


ctt 


tac 


agg 


gat 


gat 


att 


cag 


eta 


atg 


acc 


age 


ttg 


ggc 


tat 


44 


Trp 


Glu 


Leu 


Tyr 


Arg 


Asp 


Asp 


lie 


Gin 


Leu 


Met 


Thr 


Ser 


Leu 


Gly 


Tyr 


45 




50 










55 










60 










47 


aat 


get 


tat 


agg 


ttc 


tec 


ata 


gag 


tgg 


age 


agg 


eta 


ttc 


cca 


gag 


gaa 


48 


Asn 


Ala 


Tyr 


Arg 


Phe 


Ser 


He 


Glu 


Trp 


Ser 


Arg 


Leu 


Phe 


Pro 


Glu 


Glu 


49 


65 












70 










75 










80 


51 


aat 


aaa 


ttt 


aat 


gaa 


gat 


get 


ttc 


atg 


aaa 


tac 


egg 


gag 


att 


ata 


gac 


52 


Asn 


Lys 


Phe 


Asn 


Glu 


Asp 


Ala 


Phe 


Met 


Lys 


Tyr 


Arg 


Glu 


He 


He 


Asp 


53 












85 










90 










95 




55 


ttg 


tta 


ttg 


acg 


aga 


ggt 


ata 


act 


ccc 


ctg 


gtg 


acc 


eta 


cac 


cac 


ttt 


56 


Leu 


Leu 


Leu 


Thr 


Arg 


Gly 


He 


Thr 


Pro 


Leu 


Val 


Thr 


Leu 


His 


His 


Phe 


57 










100 










105 










110 






59 


act 


age 


cct 


etc 


tgg 


ttc 


atg 


aag 


aaa 


ggt 


ggc 


ttc 


ctt 


agg 


gag 


gag 


60 


Thr 


Ser 


Pro 


Leu 


Trp 


Phe 


Met 


Lys 


Lys 


Gly 


Gly 


Phe 


Leu 


Arg 


Glu 


Glu 


61 








115 










120 










125 








63 


aac 


eta 


aaa 


cat 


tgg 


gaa 


aag 


tac 


ata 


gaa 


aag 


gtt 


get 


gag 


ctt 


tta 


64 


Asn 


Leu 


Lys 


His 


Trp 


Glu 


Lys 


Tyr 


He 


Glu 


Lys 


Val 


Ala 


Glu 


Leu 


Leu 


65 




130 










135 










140 










67 


gaa 


aaa 


gtt 


aaa 


eta 


gta 


get 


acc 


ttc 


aat 


gag 


ccg 


atg 


gta 


tac 


gta 



T, 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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68 Glu Lys Val Lys Leu Val. Ala Thr Phe Asn Glu Pro Met Val Tyr Val 

69 145 150 155 160 

71 atg atg gga tat eta acg get tat tgg ccc cca ttc att agg agt cca 

72 Met Met Gly Tyr Leu Thr Ala Tyr Trp Pro Pro Phe lie Arg Ser Pro 

73 165 170 175 

75 ttt aag gec ttt aag gta. get gca aac ctg ctt aaa get cac gca att 576 

76 Phe Lys Ala Phe Lys Val Ala Ala Asn Leu Leu Lys Ala His Ala lie 

77 180 185 190 

79 gec tat gaa ctt ctt cat ggg aaa ttc aaa gtt gga ate gta aag aat 

80 Ala Tyr Glu Leu Leu His Gly Lys Phe Lys Val Gly lie Val Lys Asn 

81 195 200 205 

83 att ccc ata ata etc cca gcg agt gac aag gag agg gat aga aaa gec 

84 He Pro He He Leu Pro Ala Ser Asp Lys Glu Arg Asp Arg Lys Ala 

85 210 215 220 

87 get gag aaa get gat aat tta ttt aac tgg cac ttt ttg gat gcg ata 

88 Ala Glu Lys Ala Asp Asn Leu Phe Asn Trp His Phe Leu Asp Ala He 

89 225 230 235 240 

91 tgg agt ggg aaa tac aga ggg gta ttt aaa aca tat agg att ccc caa 

92 Trp Ser Gly Lys Tyr Arg Gly Val Phe Lys Thr Tyr Arg He Pro Gin 

93 245 250 255 

95 agt gac gca gat ttc att ggg gtt aac tat tac acg gec age gaa gta 

96 Ser Asp Ala Asp Phe He Gly Val Asn Tyr Tyr Thr Ala Ser Glu Val 

97 260 265 270 

99 agg cat act tgg aat cct tta aaa ttc ttc ttt gag gtg aaa tta gcg 

100 Arg His Thr Trp Asn Pro Leu Lys Phe Phe Phe Glu Val Lys Leu Ala 

101 275 280 285 

103 gat att age gag agg aag act caa atg gga tgg age gtt tat cca aaa 

104 Asp He Ser Glu Arg Lys Thr Gin Met Gly Trp Ser Val Tyr Pro Lys 

105 290 295 300 

107 gga ata tac atg gec ctt aaa aaa get tec agg tat gga agg cct ctt 

108 Gly He Tyr Met Ala Leu Lys Lys Ala Ser Arg Tyr Gly Arg Pro Leu 

109 305 310 315 320 

111 tat att acg gaa aac gga ata gcg acg ctt gat gat gaa tgg aga gtg 

112 Tyr He Thr Glu Asn Gly He Ala Thr Leu Asp Asp Glu Trp Arg Val 

113 325 330 335 

115 gaa ttc ata att caa cac etc caa tac gtt cat aag get ate gaa gac 

116 Glu Phe He He Gin His Leu Gin Tyr Val His Lys Ala He Glu Asp 

117 340 345 350 

119 ggc ctg gat gta aga ggt tac ttc tat tgg tea ttt atg gat aac tac 

120 Gly Leu Asp Val Arg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

121 355 360 365 

123 gag tgg aaa gag ggg ttt ggg cct aga ttt ggc eta gtg gaa gtt gat 1152 

124 Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 

125 370 375 380 

127 tat caa acc ttc gag aga agg ccc agg aag agt get tac gta tac gga . 1200 

128 Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 

129 385 390 395 400 

131 gaa att gca aga agt aag gaa ata aag gat gag eta tta aag aga tat 

132 Glu He Ala Arg Ser Lys Glu He Lys Asp Glu Leu Leu Lys Arg Tyr 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1248 
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133 










405 










410 






415 




135 


ggc 


eta 


cca 


gaa 


ctt 


caa 


ctt 
















136 


Gly 


Leu 


Pro 


Glu 


Leu 


Gin 


Leu 
















137 








420 






















140 


<210> SEQ ID NO: 


2 




















141 


<211> LENGTH: 423 




















142 


<212> TYPE: 


PRT 






















143 


<213> ORGANISM: 


Pyrococcus horikoshii 










145 


<400> SEQUENCE: 


2 




















146 


Met 


Pro 


Leu 


Lys 


Phe 


Pro 


Glu 


Met 


Phe 


Leu Phe Gly Thr- Ala 


Thr 


Ser 


147 


1 






5 










10 






15 




148 


Ser 


His 


Gin 


He 


Glu 


Gly 


Asn 


Asn 


Arg 


Trp Asn Asp 


Trp 


Trp 


Tyr 


Tyr 


149 








20 










25 






30 






150 


Glu 


Gin 


He 


Gly 


Lys 


Leu 


Pro 


Tyr 


Arg 


Ser Gly Lys 


Ala 


Cys 


Asn 


His 


151 






35 










40 






45 








152 


Trp 


Glu 


Leu 


Tyr 


Arg 


Asp 


Asp 


He 


Gin 


Leu Met Thr Ser Leu Gly Tyr 


153 




50 










55 






60 










154 


Asn 


Ala 


Tyr 


Arg 


Phe 


Ser 


He 


Glu 


Trp 


Ser Arg Leu 


Phe 


Pro 


Glu 


Glu 


155 


65 










70 








75 








80 


156 


Asn 


Lys 


Phe 


Asn 


Glu 


Asp 


Ala 


Phe 


Met 


Lys Tyr Arg 


Glu 


He 


He 


Asp 


157 










85 










90 






95 




158 


Leu 


Leu 


Leu 


Thr 


Arg 


Gly 


He 


Thr 


Pro 


Leu Val Thr 


Leu 


His 


His 


Phe 


159 








100 










105 






110 






160 


Thr 


Ser 


Pro 


Leu 


Trp 


Phe 


Met 


Lys 


Lys Gly Gly Phe 


Leu 


Arg 


Glu 


Glu 


161 






115 










120 






125 








162 


Asn 


Leu 


Lys 


His 


Trp 


Glu 


Lys 


Tyr 


He 


Glu Lys Val 


Ala 


Glu 


Leu 


Leu 


163 




130 










135 






140 










164 


Glu 


Lys 


Val 


Lys 


Leu 


Val 


Ala 


Thr 


Phe 


Asn Glii Pro 


Met 


Val 


Tyr 


Val 


165 


145 








150 








155 








160 


166 


Met 


Met 


Gly 


Tyr 


Leu 


Thr 


Ala 


Tyr 


Trp 


Pro Pro Phe 


He 


Arg 


Ser 


Pro 


167 










165 










170 






175 




168 


Phe 


Lys 


Ala 


Phe 


Lys 


Val 


Ala 


Ala 


Asn 


Leu Leu Lys 


Ala 


His 


Ala 


He 


169 








180 










185 






190 






170 


Ala 


Tyr 


Glu 


Leu 


Leu 


His 


Gly Lys 


Phe Lys Val Gly 


He 


Val 


Lys 


Asn 


171 






195 










200 






205 








172 


He 


Pro 


He 


He 


Leu 


Pro 


Ala 


Ser 


Asp 


Lys Glu Arg 


Asp 


Arg 


Lys 


Ala 


173 




210 










215 






220 










174 


Ala 


Glu 


Lys 


Ala 


Asp 


Asn 


Leu 


Phe 


Asn 


Trp His Phe 


Leu 


Asp 


Ala 


He 


175 


225 










230 








235 








240 


176 


Trp 


Ser 


Gly 


Lys 


Tyr 


Arg 


Gly Val 


Phe 


Lys Thr Tyr Arg 


He 


Pro 


Gin 


177 










245 










250 






255 




178 


Ser 


Asp 


Ala 


Asp 


Phe 


He 


Gly Val 


Asn 


Tyr Tyr Thr 


Ala 


Ser 


Glu 


Val 


179 








260 










265 






270 






180 


Arg 


His 


Thr 


Trp 


Asn 


Pro 


Leu 


Lys 


Phe 


Phe Phe Glu 


Val 


Lys 


Leu 


Ala 


181 




275 










280 






285 








182 


Asp 


He 


Ser 


Glu 


Arg 


Lys 


Thr 


Gin 


Met Gly Trp Ser Val Tyr 


Pro 


Lys 


183 




290 










295 






300 










184 


Gly 


He 


Tyr 


Met 


Ala 


Leu 


Lys 


Lys 


Ala 


Ser Arg Tyr Gly Arg 


Pro 


Leu 


185 


305 










310 








315 








320 



1269 
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186 Tyr lie Thr Glu Asn Gly lie Ala Thr Leu Asp Asp Glu Trp Arg Val 

187 325 330 335 

188 Glu Phe lie lie Gin His Leu Gin Tyr Val His Lys Ala He Glu Asp 

189 340 345 350 

190 Gly Leu Asp Val Arg Gly Tyr Phe Tyr Trp Ser Phe Met Asp Asn Tyr 

191 355 360 365 

192 Glu Trp Lys Glu Gly Phe Gly Pro Arg Phe Gly Leu Val Glu Val Asp 

193 370 375 380 

194 Tyr Gin Thr Phe Glu Arg Arg Pro Arg Lys Ser Ala Tyr Val Tyr Gly 

195 385 390 395 400 

196 Glu He Ala Arg Ser Lys Glu He Lys Asp Glu Leu Leu Lys Arg Tyr 

197 405 410 415 

198 Gly Leu Pro Glu Leu Gin Leu 

199 420 

201 <210> SEQ ID NO: 3 

202 <211> LENGTH: 57 

203 <212> TYPE: DNA 

204 <213> ORGANISM: Artificial Sequence^ 

206 <220> FEATURE: 

207 <223> OTHER INFORMATION: An upper primer designed to create the Ndel site 

209 <400> SEQUENCE: 3 

210 taagaaggag atatacatat gccgctgaaa ttcccggaaa tgtttctctt tggtacc 57 

212 <210> SEQ ID NO: 4 

213 <211> LENGTH: 46 

214 <212> TYPE: DNA 

215 <213> ORGANISM: Artificial Sequence 

217 <220> FEATURE: 

218 <223> OTHER INFORMATION: A lower primer designed to create the BamHI site 

220 <400> SEQUENCE: 4 

221 tttactgcag agaggatccc taatcctaaa gttgaagttc tggtag 46 

223 <210> SEQ ID NO: 5 

224 <211> LENGTH: 423 

225 <212> TYPE: PRT 

226 <213> ORGANISM: Pyrococcus horikoshii 
228 <400> SEQUENCE: 5 




229 


Met 


Pro 


Leu 


Lys 


Phe 


Pro 


Glu 


Met 


Phe 


Leu 


Phe 


Gly 


Thr 


Ala 


Thr 


Ser 


230 


1 








5 










10 










15 




231 


Ser 


Lys 


Cys 


He 


Glu 


Gly 


Asn 


Asn 


Arg 


Trp 


Asn 


Cys 


Trp 


Trp 


Tyr 


Tyr 


232 








20 










25 










30 






233 


Glu 


Gin 


He 


Gly 


Lys 


Leu 


Pro 


Tyr 


Arg 


Ser 


Gly 


Lys 


Ala 


Cys 


Asn 


His 


234 






35 










40 










45 








235 


Trp 


Glu 


Leu 


Tyr 


Arg 


Asp 


Asp 


He 


Gin 


Leu 


Met 


Thr 


Ser 


Leu Gly Tyr 


236 




50 










55 










60 










237 


Asn 


Ala 


Tyr 


Arg 


Phe 


Ser 


He 


Glu 


Trp 


Ser 


Arg 


Leu 


Phe 


Pro 


Glu 


Glu 


238 


65 










70 










75 










80 


239 


Asn 


Lys 


Phe 


Met 


Glu 


Asp 


Ala 


Phe 


Met 


Lys 


Tyr 


Arg 


Glu 


He 


He 


Asp 


240 










85 










90 










95 




241 


Leu 


Leu 


Leu 


Thr 


Phe 


Gly 


He 


Thr 


Pro 


Leu 


Val 


Thr 


Leu 


His 


His 


Phe 


242 








100 










105 










110 
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243 


Thr 


Ser 


Pro 


Leu 


Trp 


Phe 


Met 


Lys 


Lys 


Gly Gly 


Phe 


Leu 


Arg 


Glu 


Glu 


244 






115 










120 










125 








245 


Asn 


Leu 


Lys 


His 


Trp 


Glu 


Lys 


Tyr 


He 


Glu 


Lys 


Val 


Ala 


Glu 


Leu 


Leu 


246 




130 










135 










140 










247 


Glu 


Lys 


Val 


Lys 


Leu 


Val 


Ala 


Thr 


Phe 


Asn 


Glu 


Pro 


Met 


Val 


Tyr 


Val 


248 


145 










150 










155 










160 


249 


Met 


Met 


Gly 


Tyr 


Leu 


Thr 


Ala 


Tyr 


Trp 


Pro 


Pro 


Phe 


He 


Arg 


Ser 


Pro 


250 










165 










170 










175 




251 


Phe 


Lys 


Ala 


Phe 


Lys 


Val 


Ala 


Ala 


Asn 


Leu 


Leu 


Lys 


Ala 


His 


Ala 


He 


252 








180 










185 










190 






253 


Ala 


Tyr 


Glu 


Leu 


Leu 


His 


Gly 


Lys 


Phe 


Lys 


Val 


Gly 


He 


Val 


Lys 


Asn 


254 






195 










200 










205 








255 


lie 


Pro 


He 


He 


Leu 


Pro 


Ala 


Ser 


Asp 


Lys 


Glu 


Arg 


Asp 


Arg 


Lys 


Ala 


256 




210 










215 










220 










257 


Ala 


Glu 


Lys 


Ala 


Asp 


Asn 


Leu 


Phe 


Asn 


Trp 


His 


Phe 


Leu 


Asp 


Ala 


He 


258 


225 










230 










235 










240 


259 


Trp 


Ser; Gly Lys 


Tyr 


Arg 


Gly 


Val 


Phe 


Lys 


Thr 


Tyr 


Arg 


He 


Pro 


Gin 


260 










245 










250 










255 




261 


Ser 


Asp 


Ala 


Asp 


Phe 


He 


Gly 


Met 


Asn 


Tyr 


Tyr 


Thr 


Ala 


Ser 


Glu 


Val 


262 








260 










265 










270 






263 


Arg 


His 


Thr 


Trp 


Asn 


Pro 


Leu 


Lys 


Phe 


Phe 


Phe 


Glu 


Val 


Lys 


Leu 


Ala 


264 






275 










280 










285 








265 


Asp 


He 


Ser 


Glu 


Arg 


Lys 


Thr 


Gin 


Met 


Gly 


Trp 


Ser 


Val 


Tyr 


Pro 


Lys 


266 




290 










295 










300 










267 


Gly 


He 


Tyr Met 


Ala 


Leu 


Lys 


Lys 


Ala 


Ser 


Pro 


Tyr Gly Arg 


Pro 


Leu 


268 


305 










310 










315 










320 


269 


Tyr 


He 


Thr 


Glu 


Asn 


Gly 


He 


Ala 


Thr 


Leu 


Asp 


Asp 


Glu 


Trp 


Arg 


Val 


270 










325 










330 










335 




271 


Glu 


Phe 


He 


He 


Gin 


His 


Leu 


Gin 


Tyr 


Val 


His 


Lys 


Ala 


He 


Glu 


Asp 


272 








340 










345 










350 






273 


Gly 


Leu 


Asp 


Val 


Arg 


Gly 


Tyr 


Phe 


Tyr 


Trp 


Ser 


Phe 


Met 


Asp 


Asn 


Tyr 


274 






355 










360 










365 








275 


Glu 


Trp 


Lys 


Glu 


Gly 


Phe 


Gly 


Pro 


Arg 


Phe Gly 


Leu 


Val 


Glu 


Val 


Asp 


276 




370 










375 










380 










277 


Tyr 


Gin 


Thr 


Phe 


Glu 


Arg 


Arg 


Pro 


Arg 


Lys 


Ser 


Ala 


Tyr 


Val 


Tyr 


Gly 


278 


385 










390 










395 










400 


279 


Glu 


He 


Ala 


Arg 


Ser 


Lys 


Glu 


He 


Lys 


Asp 


Glu 


Leu 


Leu 


Lys 


Arg 


Tyr 


280 










405 










410 










415 




281 


Gly 


Leu 


Pro 


Glu 


Leu 


Gin 


Leu 




















282 








420 



























284 <210> SEQ ID NO: 6 

285 <211> LENGTH: 483 

286 <212> TYPE: PRT 

287 <213> ORGANISM: Pyrococcus horikoshii 

289 <400> SEQUENCE: 6 

290 Met Lys Phe Tyr Trp Gly Val Val Gin Ser Ala Phe Gin Phe Glu Met 

291 15 10 15 

292 Gly Asp Pro Tyr Arg Arg Asn He Asp Pro Arg Ser Asp Trp Trp Tyr 

293 20 25 30 
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